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A method and arrangement for providing security through network address 
translations using tunneling and compensations 



The invention relates in general to the field of secure communications between 
computers in a packet-switched data transmission networks. More particularly the 
invention relates to the field of setting up and maintaining secure communication 
connections through a Network Address Translation or protocol conversion. 

The Internet Engineering Task Force (IETF) has standardized the IPSEC (Internet 
Protocol Security) protocol suite; the standards are well known from the Request 
For Comments or RFC documents number RFC2401, RFC2402, RFC2406, 
RFC2407, RFC2408 and RFC2409 mentioned in the appended list of references. 
The IPSEC protocols provide security for the IP or Internet Protocol, which itself 
has been specified in the RFC document number RFC791. IPSEC performs 
authentication and encryption on packet level by generating a new IP header, adding 
an Authentication Header (AH) or Encapsulating Security Payload (ESP) header in 
front of the packet. The original packet is cryptographically authenticated and 
optionally encrypted. The method used to authenticate and possibly encrypt a packet 
is identified by a security parameter index (SPI) value stored in the AH and ESP 
headers. The RFC document number RFC2401 specifies a transport mode and a 
tunnelling mode for packets; the present invention is applicable regardless of which 
of these modes is used. 

In recent years, more and more vendors and Internet service providers have started 
performing network address translation (NAT). References to NAT are found at 
least in the RFC document number RFC 1631 as well as the documents which are 
identified in the appended list of references as Srisuresh98Terminology, 
SrisureshEgevang98, Srisuresh98Security, HoldregeSrisuresh99, TYS99, 
Rekhter99, LoBorella99 and BorellaLo99. There are two main forms of address 
translation, illustrated schematically in Figs, la and lb: host NAT 101 and port 
NAT 15 1. Host NAT 101 only translates the IP addresses in an incoming packet 102 
so that an outgoing packet 103 has a different IP address. Port NAT 151 also 
touches the TCP and UDP port numbers (Traffic Control Protocol; User Datagram 
Protocol) in an incoming packet 152, multiplexing several IP addresses to a single 
IP address in an outgoing packet 153 and correpondingly demultiplexing a single IP 
address into several IP addresses for packets travelling in the opposite direction (not 
shown). Port NATs are especially common in the home and small office 
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environment. The physical separation of input and output connections for the NAT 
devices is only shown in Figs, la and lb for graphical clarity; in practice there are 
many possible ways for physically connecting a NAT. 

Address translation is most frequently performed at the edge of a local network (i.e., 
translation between multiple local private addresses on on hand and fewer globally 
routable public addresses on the other). Most often, port NAT is used and there is 
only one globally routable address. A local network 154 has been schematically 
illustrated in Fig. lb. Such arrangements are becoming extremely commonplace in 
the home and small office markets. Some Internet service providers have also started 
giving private addresses to their customers, and perform address translation in their 
core networks for such addresses. In general, network address translation has been 
widely discussed in depth e.g. in the NAT working group within the Internet 
Engineering Task Force. The operating principles of a NAT device are well known, 
and there are many implementations available on the market from multiple vendors, 
including several implementations in freely available source code. The typical 
operation of a NAT may be described so that it maps IP address and port 
combinations to different IP address and port combinations. The mapping will 
remain constant for the duration of a network connection, but may change (slowly) 
with time. In practice, the NAT functionality is often integrated into a firewall or a 
router. 

Fig. lc illustrates an exemplary practical network communication situation where a 
transmitting node 181 is located in a first local area network (also known as the first 
private network) 182, which has a port NAT 183 to connect is to a wide-area 
general packet-switched network 184 like the Internet. The latter consists of a very 
large number of nodes interconnected in an arbitrary way. A receiving node 185 is 
located in a second local area network 186 which is again coupled to the wide-area 
network through a NAT 187. The denominations "transmitting node" and "receiving 
node" are somewhat misleading, since the communication required to set up 
network security services is bidirectional. The transmitting node is the one that 
initiates the communication. Also the terms "Initiator" and "Responder" are used for 
the transmitting node and the receiving node respectively. 

The purpose of Fig. lc is to emphasize the fact that the communicating nodes are 
aware of neither the number or nature of the intermediate devices through which 
they communicate nor the nature of transformations that take place. In addition to 
NATs, there are other types of devices on the Internet that may legally modify 
packets as they are transmitted. A typical example is a protocol converter, whose 
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main job is to convert the packet to a different protocol without disturbing normal 
operation. Using them leads to problems very similar to the NAT case. A fairly 
simple but important example is converting between IPv4 and IPv6, which are 
different versions of the Internet Protocol. Such converters will be extremely 
important and commonplace in the near future. A packet may undergo several 
conversions of this type during its travel, and it is possible that the endpoints of the 
communication actually use a different protocol. Like NAT, protocol conversion is 
often performed in routers and firewalls. 

It is well known in the IPSEC community that the IPSEC protocol does not work 
well across network address translations. The problem has been discussed at least in 
the references given as HoldregeSrisuresh99 and Rekhter99. 

In the Finnish patent application number 974665 and the corresponding PCT 
application number FI98/01032, which are incorporated herein by reference, we 
have presented a certain method for performing IPSEC address translations and a 
method for packet authentication that is insensitive to address transformations and 
protocol conversions en route of the packet. Additionally in said applications we 
have presented a transnutting network device and a receiving network device that 
are able to take advantage of the aforementioned method. However, some problems 
related to the provision of network security services over network address 
translation remain unsolved in said previous patent applications. 

It is an object of the present invention to present a method and the corresponding 
devices for providing network network security services over network address 
translation in a reliable and advantageous way. 

According to a first aspect of the invention there is therefore provided a method for 
securely communicating packets between a first computer device and a second 
computer device through a packet-switched data transmission network comprising 
intermediate computer devices, where at least one of said computer devices 
performs a network address translation and/or a protocol conversion, the method 
comprising the steps of 

- determining what network address translations, if any, occur on packets 
transmitted between the first computer device and the second computer device, 

- taking packets conforming to a first protocol and encapsulating them into packets 
conforming to a second protocol, which second protocol is capable of traversing 
network address translations, 
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- transmitting said packets conforming to said second protocol from the first 
computer device to the second computer device and 

- decapsulating said transmitted packets conforming to said second protocol into 
packets conforming to said first protocol. 

5 According to a second aspect of the invention there is provided a method for 
conditionally setting up a secure communication connection between a first 
computer device and a second computer device through a packet-switched data 
transmission network comprising intermediate computer devices, where at least one 
of said computer devices performs a network address translation and/or a protocol 
10 conversion, the method comprising the steps of 

- finding out, whether or not the second computer device supports a communication 
method where: it is determined what network address translations, if any, occur on 
packets transmitted between the first computer device and the second computer 
device; packets are taken that conform to a first protocol and encapsulated into 

15 packets that conform to a second protocol, which second protocol is capable of 
traversing network address translations; said packets conforming to said second 
protocol are transmitted from the first computer device to the second computer 
device; and said transmitted packets conforming to said second protocol are 
decapsulated into packets conforming to said first protocol, 

20 - as a response to a finding indicating that the second computer device supports said 
communication method, setting up a secure communication connection between the 
first computer device and the second computer device in which communication 
connection said communication method is employed and 

- as a response to a finding indicating that the second computer device does not 
25 support said communication method, disabling the use of said communication 

method between the first and the second computer devices. 

According to a third aspect of the invention there is provided a method for 
tunnelling packets between a first computer device and a second computer device 
through a packet-switched data transmission network comprising intermediate 
30 computer devices, where at least one of said computer devices performs a network 
address translation and/or a protocol conversion, the method comprising the steps of 

- taking packets conforming to a first protocol and encapsulating them at the first 
computer device into packets conforming to a second protocol, which second 
protocol is capable of traversing network address translations, 
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- transmitting said packets conforming to said second protocol from the first 
computer device to the second computer device, 

- decapsulating said transmitted packets conforming to said second protocol into 
packets conforming to said first protocol at the second computer device, 

- generating response packets conforming to said first protocol and encapsulating 
them at the second computer device into response packets conforming to said 
second protocol, 

- transmitting said response packets conforming to said second protocol from the 
second computer device to the first computer device, 

- decapsulating said transmitted response packets conforming to said second 
protocol into packets conforming to said first protocol at the first second computer 
device, 

- using the response packets at the first computer device to obtain information about 
the address translations occurred on packets transmitted between the first computer 
device and the second computer device and 

- using said obtained information to modify the operation of the tunnelling of 
packets between the first computer device and the second computer device. 

According to a fourth aspect of the invention there is provided a method for 
turmelling packets between a first computer device and a second computer device 
through a packet-switched data transmission network comprising intermediate 
computer devices, in which data transmission network there exists a security 
protocol comprising a key management connection that employs a specific packet 
format for key management packets, the method comprising the steps of 

- encapsulating data packets that are not key management packets into said specific 
packet format for key management packets, 

- transmitting said data packets encapsulated into the specific packet format from 
the first computer device to the second computer device, 

- discriminating at the second computer device the data packets encapsulated into 
the specific packet format from actual key management packets and 

- decapsulating the data packets encapsulated into the specific packet format. 

According to a fifth aspect of the invention there is provided a method for securely 
communicating packets between a first computer device and a second computer 
device through a packet-switched data transmission network comprising 
intermediate computer devices, where at least one of said computer devices 
performs a network address translation and/or a protocol conversion and where a 
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security protocol exists comprising a key management connection, the method 
comprising the steps of 

- for detennining what network address translations, if any, occur on packets 
transmitted between the first computer device and the second computer device: 

5 establishing a key management connection according to said security protocol 
between the first computer device and the second computer device; composing an 
indicator packet with a header part and a payload part of which both comprise the 
network addresses of the first computer device and the second computer device as 
seen by the node composing said packet; transmitting and receiving said indicator 

10 packet within the key management connection; and comparing in the received 
indicator packet the addresses contained in the header part and the payload part, and 

- using the information concerning the determined occurrences of network address 
translations to securely communicating packets between the first computer device 
and the second computer device. 

15 According to a sixth aspect of the invention there is provided a method for securely 
communicating packets between a first computer device and a second computer 
device through a packet-switched data transmission network comprising 
intermediate computer devices, where at least one of said computer devices 
performs a network address translation and/or a protocol conversion; where a 

20 security protocol is acknowledged which determines transport-mode processing of 
packets for transmission and reception; and where a high-level protocol checksum 
has been determined for checking the integrity of received packets, the method 
comprising the steps of 

- at the first computer device, performing transport-mode processing for packets to 
25 be transmitted to the second computer device, 

- at the second computer device, performing transport-mode processing for packets 
received from the first computer device, said transport-mode processing comprising 
the decapsulation of received packets and 

- at the second computer device, updating the high-level protocol checksum for 
30 decapsulated packets for compensating for changes, if any, caused by network 

address translations. 

According to a seventh aspect of the invention there is provided a method for 
maintaining the unchanged form of address translations performed by network 
address translation devices on encapsulated data transmission packets 
35 communicated between a first computer device and a second computer device 
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through a packet-switched data transmission network, the method comprising the 
steps of 

- determining which address translations occur on actual data packets transmitted 
with certain address information between the first computer device and the second 

5 computer device through the packet-switched data transmission network and 

- forcing at least one of the first computer device and the second computer device to 
transmit to the other computer device keepalive packets with address information 
identical to that of actual data packets at a high enough frequency so that network 
address translation devices constantly reuse the mappings used for network address 

10 translation even when a certain fraction of the packets communicated between the 
first computer device and the second computer device are lost in the network. 

Fig. la illustrates the known use of a host NAT, 
Fig. lb illustrates the known use of a port NAT, 

Fig. lc illustrates a known communication connection between nodes through a 

packet-switched network, 
Fig. 2a illustrates a certain Vendor ID payload applicable within the context of 
the invention, 

Fig. 2b illustrates a certain private payload applicable within the context of the 
invention, 

Fig. 2c illustrates a certain combined header structure applicable within the 

context of the invention, 
Fig. 3 illustrates certain method steps related to the application of the invention, 
Fig. 4 illustrates a transformation of header structures according to an aspect of 
the invention, and 

Fig. 5 illustrates a simplified block diagram of a network device used to 
implement the method according to the invention. 

The present invention combines and extends some of the methods of network 
address translation, tunneling over UDP, IKE, and the IKE extension mechanisms, 
in a novel and inventive way to produce a method for secure communications across 
30 network address translations and protocol conversions. The method can be made 
fully automatic and transparent to the user. 

A key point relating to the apphcability of the invention is that - at the priority date 
of the present patent application - in general only TCP (described in RFC793) and 
UDP (described in RFC768) work over NAT. This is because most NATs used in 
35 practise are port NATs, and this is the form of NAT that provides most benefits with 
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regards to the shortage of globally routable IP addresses. The invention is not, 
however, limited to the use of UDP and TCP as they are known at the priority date 
of this patent application: in general it may be said that UDP and TCP are examples 
of protocols that determine that connection identification information (i.e. 
5 addressing and port numbering) that is mapped into another form in the address 
transformation process. We may expect that other kinds of communication protocols 
and address transformations emerge in the future. 

The various aspects of the invention are related to 

- determining whether a remote host supports a certain method which is typically a 
10 secure communication method according to the invention (the "methods supported" 

aspect), 

- determining what network address translations and/or protocol conversions occur 
on packets, if any (the "occurring translations" aspect), 

- tunneling packets inside a certain carefully selected protocol, typically UDP, to 
15 make them traverse NATs (the "selected tunnelring" aspect), 

- using a keepalive method to make sure that involved NAT devices and other 
devices that use timeouts for mappings do not lose the mapping for the 
communicating hosts (the "keepalive" aspect), 

- compensating for the translations that occur before verifying the message 
20 authentication code for AH packets (the "compensation/authentication" aspect) and 

- performing address translations at either the sending or receiving node to 
compensate for multiple hosts being mapped to a single public address (the 
"compensation/mapping" aspect). 

The process of encapsulating data packets for transmission over a different logical 
25 network is called tunneling. Typically, in the case of the IP protocol, tunneling 
involves adding a new IP header in front of the original packet, setting the protocol 
field in the new header appropriately, and sending the packet to the desired 
destination (endpoint of the tunnel). Tunneling may also be implemented by 
modifying the original packet header fields or replacing them with a different 
30 header, as long as a sufficient amount of information about the original packet is 
saved in the process so that it will be possible to reconstruct the packet at the end of 
the tunnel into a form sufficiently similar to the original packet entering the tunnel. 
The exact amount of information that needs to be passed with the packet depends on 
the network protocols, and information may be passed either explicitly (as part of 
35 the tunnelled packet) or implicitly (by the context, as determined e.g. by previously 
transmitted packets or a context identifier in the tunneled packet). 
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It is well known in the art how to tunnel packets over a network. At least the 
references given as RFC 1226, RFC 1234, RFC 1241, RFC 1326, RFC 1701, 
RFC1853, RFC2003, RFC2004, RFC2107, RFC2344, RFC2401, RFC2406, 
RFC2473 and RFC2529 relate to the subject of tunneling. For example, RFC 1234 

5 presents a method of tunneling IPX frames over UDP. In that method, packets are 
tunneled to a fixed UDP port and to the decapsulator's IP address. 

The IPSEC protocol mentioned in the background description typically uses the 
Internet Key Exchange or IKE protocol (known from references RFC2409, 
RFC2408 and RFC2407) for authenticating the communicating parties to each 

10 other, deriving a shared secret known only to the communicating parties, negotiating 
authentication and encryption methods to be used for the communication, and 
agreeing on a security parameter index (SPI) value and a set of selectors to be used 
for the communication. The IKE protocol was previously known as the 
ISAKMP/Oakley, where the acronym ISAKMP comes from Internet Security 

15 Association Key Management Protocol. Besides said normal negotiation specified 
in the IKE standard, IKE supports certain mechanisms for extension. The Vendor 
ID payload known from reference RFC2408 allows communicating parties to 
determine whether the other party supports a particular private extension 
mechanism. The IPSEC DOI (Domain of Interpretation) known as RFC2407 

20 reserves certain numeric values for such private extensions. 

Currently, the well-known Vendor ID payload is defined to have the format 
illustrated in Fig. 2a, where the column numbers correspond to bit positions. For the 
purposes of the present invention the Vendor ID field 201 is the most important part 
of the Vendor ID payload. In the context of the IKE protocol, negotiating whether 
25 the remote host supports a certain method for providing secure network 
communications can be performed as follows. The terminology used here is 
borrowed from the IKE documents. 

The IKE protocol determines the so-called Phase 1 of the mutual exchange of 
messages between the Initiator (i.e., the node first sending a packet to the other) and 
30 the Responder (i.e., the node first receiving a packet). Fig. 3 illustrates an exchange 
of first Phase 1 messages between the Initiator and the Responder. According to the 
"methods supported" aspect of the invention both devices include a certain Vendor 
ID Payload in a certain Phase 1 message which is most advantageously their first 
Phase 1 message. This payload indicates that they support the method in question. 
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In Fig. 3 the Vendor ED fields contained within the Initiator's first (or other) Phase 1 
message is schematically shown as 201' and the Vendor ID fields contained within 
the Responder's first (or other) Phase 1 message is schematically shown as 201". 
To indicate support for a certain method the Vendor ID field in the Vendor ID 
5 Payload is basically an identification of that method: advantageously it is the MD5 
hash of a previously known identification string, e.g. "SSH EPSEC NAT Traversal 
Version 1", without any trailing zeroes or newlines. Producing MD5 hashes of 
arbitrary character sequences is a technique well known in the art for example from 
the publication RFC 1321 mentioned in the list of references. 

10 Next we will address the "occurring translations" aspect of the invention. In 
addition to the above-mentioned Phase 1, the IKE protocol determines the so-called 
Phase 2 of the mutual exchange of messages between the Initiator and the 
Responder. According to the "occurring translations" aspect of the invention the 
parties can determine which translations occur by including the IP addresses they 

15 see in private payloads of certain Phase 2 Quick Mode messages, which are most 
advantageously their first Phase 2 Quick Mode messages. Any unused number in the 
private payload number range can be used to signify such use of the private payload 
(e.g. 157, which is unused at the priority date of the present patent application). 

The private payload used to reveal the occurring translations can have e.g. the 
20 format illustrated in Fig. 2b. Field 211 contains a type code that identifies the types 
of the addresses that appear in fields 212 and 213. Field 212 contains the address of 
the Initiator as seen by the node sending the message, and field 213 contains the 
address of the Responder as seen by the node sending the message. Fig. 3 shows the 
exchange of (first) Phase 2 Quick Mode messages between the Initiator and the 
25 Responder so that the corresponding fields 211', 212' and 213' are included in the 
message sent by the former and the fields 211", 212" and 213" are included in the 
message sent by the latter. 

According to known practice the addresses of the Initiator and Responder are also 
included in the header of the packet that contains the payload of Fig. 2b. In the 

30 header they are susceptible to address translations and other processing whereas in 
the private payload they are not. When the packet with the payload of Fig. 2b is 
received, the addresses contained in it are compared with those seen in the packet 
header. If they differ, then an address translation occurred on the packet. Later we 
will refer to the use of the standard IKE port number 500 together with applying the 

35 invention; as an additional way of detecting occurred translations the port numbers 
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of the received packet can also be compared against the standard IKE port number 
500 to determine if port translations occurred. 

An aspect of some importance when handling the addresses is that the UDP source 
port of the packet can be saved for later use. It would usually be saved with the data 
5 structures for Phase 1 ISAKMP security associations, and would be used to set up 
compensation processing for Phase 2 IPSEC security associations. 

To use the method described above to implement the "occurred translations" aspect 
of the invention, the hosts must modify their Phase 2 identification payloads: the 
payload illustrated in Fig. 2b is not known in the existing standards. One possibility 
10 is to restrict the payloads to the ID_IPV4_ADDR and ID_IPV6_ADDR types, 
which would be appropriate for host-to-host operation. 

Next we will address the "selected tunnelling", "compensation/authentication" and 
"compensation/mapping" aspects of the invention. According to this aspect of the 
invention the actual data packets can be tunneled over the same connection which is 

15 used to set up the security features of the communication connection, e.g. the UDP 
connection used for IKE. This ensures that the actual data packets will experience 
the same translations as the IKE packets did when the translation was determined. 
Taken that the standard port number 500 has been determined for IKE, this would 
mean that all packets are sent with source port 500 and destination port 500, and a 

20 method is needed to distinguish the real IKE packets from those containing 
encapsulated data. One possible way of doing this takes advantage of the fact that 
the IKE header used for real IKE packets contains an Initiator Cookie field: we may 
specify that Initiators that support this aspect of the invention never generate 
cookies that have all zeroes in their four first bytes. The value zero in the 

25 corresponding four bytes is then used to recognize the packet as a tunneled data 
packet. In this way, tunneled data packets would have four zero bytes at the 
begirming of the UDP payload, whereas real IKE packets never would. 

Fig. 4 illustrates the encapsulation of actual IPSEC packets into UDP for 
transmission. Basically, a UDP header 403 and a short intermediate header 404 are 
30 inserted after the IP header 401 already in the packet (with the protocol field copied 
to the intermediate header). The IP header 401 is slightly modified to produce a 
modified IP header 401'. The IP payload 402 stays the same. The simple illustration 
of the unencapsulated IPSEC packet on the left should not be misinterpreted: this 
packet is not plaintext but has been processed according to AH or ESP or 
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corresponding other transformation protocol in the sending node before its 
encapsulation into UDP. 

Without limiting the generality, it is assumed in the presentation here that the 
encapsulation according to Fig. 4 is always performed by the same nodes that 
5 perform IPSEC processing (either an end node or a VPN device). It should also be 
noted that instead of encapsulating the IPSEC packets into UDP they could be 
encapsulated into TCP. This alternative would probably require using fake session 
starts and ends so that the first packet has the SYN bit and the last packet has the 
FIN bit, as specified in the TCP protocol. 

10 In encapsulating an actual data packet or a "datagram" according to Fig. 4, the 
original IP header 401 - defined in RFC791 - is modified to produce the modified 
IP header 401' as follows: 

* the Protocol field in the IP header (not separately shown) is replaced by protocol 
17 for UDP in accordance with RFC768, 

15 * the Total Length field in the IP header (not separately shown) is incremented by 
the combined size of the UDP and intermediate headers (total 16 bytes) and 

* the Header Checksum field in the IP header (not separately shown) is recomputed 
in accordance with the rules given in RFC791. 

As seen from Fig. 4, an UDP header 403 - as defined in RFC768 - and an 
20 intermediate header 404 are inserted after the IP header. The UDP header is 8 octets 
and the intermediate header is 8 octets, for a total of 16 octets. These headers are 
treated as one in the following discussion. The combined header has most 
advantageously the format illustrated in Fig. 2c. Fields of this header are set as 
follows: 

25 * The Source Port field 221 is set to 500 (same as IKE). If the packet goes through 
NAT, this may be different when the packet is received. 

* The Destination Port field 222 is set to the port number from which the other end 
appears to be sending packets. If the packet goes through NAT, the recipient may 
see a different port number here. 

30 * The UDP Length field 223 is the length of the UDP header plus the length of the 
UDP data field. In this case, it also includes the intermediate header. The value is 
computed in bytes as 16 plus the length of the original IP packet payload (not 
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including the original IP header, which is included in the Length field in the IP 
header). 

* The UDP Checksum field 224 is most advantageously set to 0. The UDP 
checksum is optional, and we do not wish to calculate or check it with this tunneling 
mechanism. Integrity of the data is assumed to be protected by an AH or ESP header 
within the tunneled packet. 

* The Must be zero field 225: This field must contain a previously agreed fixed 
value, which is most advantageously all zeroes. The field overlaps with the first four 
bytes of the Initiator Cookie field in an actual IKE header. Any Initiator that 
supports this aspect of the invention must not use a cookie where the first four bytes 
are zero. These zero bytes are used to separate the tunneled packets from real 
ISAKMP packets. Naturally some other fixed value than "all zeroes" could be 
chosen, but the value must be fixed for this particular use. 

* Protocol field 226: The value of this field is copied from the known Protocol field 
in the original IP header (not separately shown in Fig. 4). 

* Reserved field 227: most advantageously sent as all zeroes; ignored on reception. 

The sender inserts this header in any packets tunneled to a destination behind NAT. 
Information about whether NAT is used can be stored on a per SA (Security 
Association) basis in the policy manager. The encapsulation referred to in Fig. 4 can 
be implemented either as a new transform or as part of the otherwise known AH and 
ESP transforms. 

The encapsulation operation makes use of the UDP port number and IP address of 
the remote host, which were determined during the IKE negotiation. 

The receiver decapsulates packets from this encapsulation before doing AH or ESP 
processing. Decapsulation removes this header and updates the Protocol, Length, 
and Checksum fields of the IP header. No configuration data (port number etc.) is 
needed for this operation. 

The decapsulation should be performed only if all of the following selectors match: 

* destination address is the destination address of this host, 

* source address is the address of a host with which this host has agreed to use this 
tunnelling, 
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* the Protocol field indicates UDP, 

* the Destination port field value is 500 and 

* the Source port field value indicates the port with which this host has agreed to 
use this tunneling. (Note that there may be multiple source addresses and ports for 

5 which this tunneling is performed; each of them is treated by a separate set of 
selectors.) 

During decapsulation the source address in the received packet can be replaced by 
the real source address received during the IKE negotiation. This implements the 
compensation for AH MAC verification. The address is again changed in the post- 
10 processing phase below. Because of this compensation, the standard AH and ESP 
transforms can be used unmodified. 

In Fig. 3 the AH/ESP processing at the sending node is schematically shown as 
block 301, encapsulation of datagrams into UDP is schematically shown as block 
302, the corresponding decapsulation of datagrams from UDP is schematically 
15 shown as block 303 and AH/ESP processing at the receiving node is schematically 
shown as block 304. 

Additional compensation must be done after the packet has been decapsulated from 
AH or ESP. This additional decapsulation must deal with the fact that the outer 
packet actually went through NAT (illustrated schematically in Fig. 3 as block 305), 
20 and consequently the plaintext packet must also undergo a similar transformation. 
The recipient must see the address of the NAT device as the address of the host, 
rather than the original internal address. Alternatively, this compensation could have 
been performed by the sender of the packet before encapsulating it within AH or 
ESP. 

25 There are several alternatives for this additional compensation for various special 
cases (the best compensation depends on the particular application): 

* Allocating a range of network addresses for this processing (say, in the link-local 
use range 169.254.x.x - the actual values do not matter; basically we just want an 
arbitrary network that no-one else is using). An address in this range is allocated for 

30 each <natip, ownip, natport, ownport> combination, where natip means the IP 
address of the NAT, ownip means the processing device's own IP address, natport 
means the port number at the NAT and ownport means the processing device's own 
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port number. The remote address in the packet is replaced by this address before the 
packet is sent to protocol stacks. 

* As part of the compensation, the TCP checksum for internal hosts must be 
recomputed if host addresses or port numbers changed. TCP checksum 
computations may also be incremental, as is known from RFC 1071. Port NAT may 
need to be performed for the source port 

* When used as a VPN between two sites using incompatible (possibly overlapping) 
private address spaces, address translation must be performed to make the addresses 
compatible with local addresses. 

* When used as a VPN between two sites using compatible (non-overlapping) 
private address spaces, and tunnel mode is used, no additional compensation may be 
needed. 

* Address translation may need to be performed for the contents of certain protocol 
packets, such as FTP (known from RFC959) or H.323. Other similar issues are 
discussed in the reference given as HoldregeSrisuresh99. 

* It may also be possible to use random addresses for the client at the server, and 
perform address translation to this address. This could allow the server to 
distinguish between multiple clients behind the same NAT, and could avoid manual 
configuration of the local address space. 

* The compensation operation may or may not interact with the TCP/IP stack on the 
local machine to reserve UDP port numbers. 

In general, this invention does not significantly constrain the method used to 
compensate for inner packets the NAT occurring for the outer header. The optimal 
method for performing such compensation may be found among the above-given 
alternatives by experimenting, or some other optimal method could be presented. 

Next we will address the "keepalive" aspect of the invention, i.e. ensuring that the 
network address translations performed in the network do not change after the 
translations that occur have been determined. Network address translators cache the 
information about address mapping, so that they can reverse the mapping for reply 
packets. If TCP is used, the address translator may look at the FIN bit of the TCP 
header to determine when it can drop a particular mapping. For UDP, however, 
there is no explicit termination indication for flows. For this reason, many NATs 
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will time out mappings for UDP quite fast (even as fast as in 30 seconds). Thus, it 
becomes necessary to force the mapping to be maintained. 

A possible way of ensuring the mamtaining of mappings is to send keepalive 
packets frequently enough that the address translation remains in the cache. When 

5 computing the required frequency, one must take into account that packets may be 
lost in the network, and thus multiple keepalives must be sent within the estimated 
shortest period in which NATs may forget the mapping. The appropriate frequency 
depends on both the period the mappings are kept cached and on the packet loss 
probability of the network; optimal frequency values for various context may be 

10 found through experimenting. 

Keepalive packets do not need to contain any meaningful information other than the 
necessary headers that are equal to the data packet headers to ensure that the 
keepalive packets will be handled exactly in the same way as the actual data 
packets. A keepalive packet may contain an indicator that identifies it as a keepalive 

15 packet and not a data packet; however it may also be determined that all packets that 
do not contain meaningful payload information are interpreted to be keepalive 
packets. In Fig. 3 the transmission of keepalive packets is schematically illustrated 
by block 306 and the reception and discarding of them is schematically illustrated 
by block 307. It should be noted that the use of keepalive packets is not needed at 

20 all if actual data packets are transmitted frequently enough and/or the connection is 
to remain valid only for such a short time (e.g. a few seconds) that it is improbable 
that any intermediate device would delete the mapping information from its cache. 
Keepalive packets need to be transmitted in one direction only, although they may 
be transmitted also bidirectionally; the drawback resulting from their bidirectional 

25 transmission is the resulting increase in unnecessary network traffic. The invention 
does not limit the direction(s) in which keepalive packets (if any) are transmitted. 

Fig. 5 is a simplified block diagram of a network device 500 that can act as the 
Initiator or the Responder according to the method of providing secure 
communications over network address translations in accordance with the invention. 

30 Network interface 501 connects the network device 500 physically to the network. 
Address management block 502 keeps track of the correct network addresses, port 
numbers and other essential public identification information of both the network 
device 500 itself and its peer (not shown). IKE block 503 is responsible for the key 
management process and other activities related to the exchange of secret 

35 information. Encryption/decryption block 504 implements the encryption and 
decryption of data once the secret key has been obtained by the IKE block 503. 
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Compensation block 505 is used to compensate for the permissible transformations 
in the transmitted and/or received packets according to the invention. Either one of 
blocks 504 and 505 may be used to transmit, receive and discard keepalive packets. 
Packet assembler/disassembler block 506 is the intermediator between blocks 502 to 

5 505 and the physical network interface 501. All blocks operate under the super- 
vision of a control block 507 which also takes care of the routing of information 
between the other blocks and the rest of the network device, for example for 
displaying information to the user through a display unit (not shown) and obtaining 
commands from the user through a keyboard (not shown). The blocks of Fig. 5 are 

10 most advantageously implemented as pre-programmed operational procedures of a 
microprocessor, which implementation is known as such to the person skilled in the 
art. Other arrangements than that shown in Fig. 5 may as well be used to reduce the 
invention into practice. 

Even though the present invention was presented in the context of IKE, and 
15 tunneling using the IKE port, it should be understood that the invention applies to 
also other analogous cases using different packet formatting methods, different 
negotiation details, a different key exchange protocol, or a different security 
protocol. The invention may also be applicable to non-IP protocols with suitable 
characteristics. The invention is equally applicable to both IPv4 and IPv6 protocols. 
20 The invention is also intended to apply to future revisions of the IPSEC and IKE 
protocols. 

It should also be understood that the invention can also be applied to protocol 
translations in addition to just address translations. Adapting the present invention 
to protocol translations should be well within the capabilities of a person skilled in 
25 the art given the description here and the discussions regarding protocol translation 
in the former patent applications of the same applicant mentioned above and 
incorporated herein by reference. 
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Claims 

1 . A method for securely communicating packets between a first computer device 
(181, INITIATOR) and a second computer device (185, RESPONDER) through a 
packet-switched data transmission network (184) comprising intermediate computer 

5 devices (183, 187, 305), where at least one of said computer devices performs a 
network address translation and/or a protocol conversion, characterised in that the 
method comprises the steps of 

- detennining what network address translations, if any, occur on packets 
transmitted between the first computer device and the second computer device, 

10 - taking (301) packets conforming to a first protocol and encapsulating (302) them 
into packets confonning to a second protocol, which second protocol is capable of 
traversing network address translations, 

- transmitting said packets confonning to said second protocol from the first 
computer device to the second computer device and 

15 - decapsulating (303) said transmitted packets conforming to said second protocol 
into packets conforming (304) to said first protocol. 

2. A method according to claim 1, characterised in that the step of taking (301) 
packets conforming to a first protocol and encapsulating (302) them into packets 
conforming to a second protocol comprises the substeps of 

20 - taking packets conforming to the Internet Protocol (40 1, 402), 

- processing said packets according to the IPSEC protocol suite and 

- encapsulating the processed packets into packets conforming to the User Datagram 
Protocol (401', 403,404). 

3. A method according to claim 1, characterised in that the step of taking (301) 
25 packets conforming to a first protocol and encapsulating (302) them into packets 

conforming to a second protocol comprises the substeps of 

- taking packets conforming to the Internet Protocol, 

- processing said packets according to the IPSEC protocol suite and 

- encapsulating the processed packets into packets conforming to the Transmission 
30 Control Protocol. 

4. A method according to claim 1, characterised in that it further comprises the 
step of compensating for the network address translations on said second protocol in 



WO 00/78008 



PCT/FI00/00537 



24 

the packets that are transmitted from the first computer device to the second 
computer device. 

5. A method according to claim 4, characterised in that said step of 
compensating for the network address translations comprises a step of performing 
address translation based on the information obtained in the step of detennining 
what network address translations, if any, occur on packets transmitted between the 
first computer device and the second computer device. 

6. A method according to claim 5, characterised in that said step of 
compensating for the network address translations further comprises a step of 
performing port number translation based on the information obtained in the step of 
detennining what network address translations, if any, occur on packets transmitted 
between the first computer device and the second computer device. 

7. A method according to claim 1, characterised in that it additionally comprises 
the step of periodically transmitting (306, 307) keepalive packets between the first 
computer device and the second computer device to ensure that the network address 
translations, if any, occurring on packets transmitted between the first computer 
device and the second computer device stay the same. 

8. A method for conditionally setting up a secure communication connection 
between a first computer device (181, INITIATOR) and a second computer device 
(185, RESPONDER) through a packet-switched data transmission network (184) 
comprising intermediate computer devices (183, 187, 305), where at least one of 
said computer devices performs a network address translation and/or a protocol 
conversion, characterised in that the method comprises the steps of 

-finding out (201', 201'), whether or not the second computer device supports a 
communication method where: it is determined what network address translations, if 
any, occur on packets transmitted between the first computer device and the second 
computer device; packets are taken that conform to a first protocol and encapsulated 
into packets that conform to a second protocol, which second protocol is capable of 
traversing network address translations; said packets conforaiing to said second 
protocol are transmitted from the first computer device to the second computer 
device; and said transmitted packets conforming to said second protocol are 
decapsulated into packets conforming to said first protocol, 

- as a response to a finding indicating that the second computer device supports said 
communication method, setting up a secure communication connection between the 
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first computer device and the second computer device in which communication 
connection said communication method is employed and 

- as a response to a finding indicating that the second computer device does not 
support said communication method, disabling the use of said communication 

5 method between the first and the second computer devices. 

9. A method for tunnelling packets between a first computer device (181, 
INITIATOR) and a second computer device (185, RESPONDER) through a packet- 
switched data transmission network (184) comprising intermediate computer 
devices (183, 187, 305), where at least one of said computer devices performs a 
10 network address translation and/or a protocol conversion, characterised in that the 
method comprises the steps of 

- establishing a bidirectional taimelling mode between the first computer device and 
the second computer device by exchanging packets conforming to a secure 
communication protocol, 

15 - taking (301) packets conforming to a first protocol and encapsulating (302) them at 
the first computer device into packets conforming to a second protocol, which 
second protocol is capable of traversing network address translations, 

- transmitting said packets conforming to said second protocol from the first 
computer device to the second computer device, 

20 - decapsulating (303) said transmitted packets conforming to said second protocol 
into packets conforming (304) to said first protocol at the second computer device, 

- obtaining information about the address translations occurred on packets 
transmitted between the first computer device and the second computer device and 

- using said obtained information to modify the established bidirectional tunnelling 
25 mode between the first computer device and the second computer device. 

10. A method according to claim 9, characterised in that the step of obtaining 
information about the address translations occurred on packets transmitted between 
the first computer device and the second computer device comprises the substeps of 

- transmitting a packet between the first computer device and the second computer 
30 device, said packet comprising a header part and a payload part, and 

- comparing a network address transmitted in said payload part to a network address 
transmitted in said header part in order to find out what changes have occurred on 
said network address transmitted in said header part. 
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11. A method according to claim 9, characterised in that it additionally comprises 
the step of periodically transmitting (306, 307) keepalive packets between the first 
computer device and the second computer device to ensure that the network address 
translations, if any, occurring on packets transmitted between the first computer 

5 device and the second computer device stay the same. 

12. A method according to claim 9, characterised in that the step of using said 
obtained information to modify the operation of the tunnelling of packets comprises 
the substep of introducing an address translation before the encapsulation (302) of 
packets in order to compensate for the network address translations that occur on 

10 packets transmitted between the first computer device and the second computer 
device. 

13. A method according to claim 9, characterised in that the step of using said 
obtained information to modify the operation of the tunnelling of packets comprises 
the substep of introducing an address translation after the decapsulation (303) of 

15 packets in order to compensate for the network address translations that occur on 
packets transmitted between the first computer device and the second computer 
device. 

14. A method for tunnelling packets between a first computer device (181, 
INITIATOR) and a second computer device (185, RESPONDER) through a packet- 

20 switched data transmission network (184) comprising intermediate computer 
devices (183, 187, 305), in which data transmission network there exists a security 
protocol comprising a key management connection that employs a specific packet 
format for key management packets, characterised in that the method comprises the 
steps of 

25 - encapsulating data packets that are not key management packets into said specific 
packet format for key management packets, 

- transmitting said data packets encapsulated into the specific packet format from 
the first computer device to the second computer device, 

- discriimnating at the second computer device the data packets encapsulated into 
30 the specific packet format from actual key management packets and 

- decapsulating the data packets encapsulated into the specific packet format. 

15. A method according to claim 14, characterised in that the step of 
encapsulating data packets that are not key management packets comprises the 
substeps of 
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- encapsulating data packets that are not key management packets into a key 
management packet format specified by the Internet Key Exchange protocol which 
defines a certain Initiator Cookie field and 

- inserting into the Initiator Cookie field of an encapsulated data packet a value 
5 indicating that the encapsulated packet is a data packet and not a key management 

packet. 

16. A method for securely communicating packets between a first computer device 
(181, INITIATOR) and a second computer device (185, RESPONDER) through a 
packet-switched data transmission network (184) comprising intermediate computer 

10 devices (183, 187, 305), where at least one of said computer devices performs a 
network address translation and/or a protocol conversion and where a security 
protocol exists comprising a key management connection, characterised in that the 
method comprises the steps of 

- for determining what network address translations, if any, occur on packets 
15 transmitted between the first computer device and the second computer device: 

estabUshing a key management connection according to said security protocol 
between the first computer device and the second computer device; composing an 
indicator packet with a header part and a payload part of which both comprise the 
network addresses of the first computer device and the second computer device as 
20 seen by the node composing said packet; transmitting and receiving said indicator 
packet within the key management connection; and comparing in the received 
indicator packet the addresses contained in the header part and the payload part, and 

- using the information concerning the determined occurrences of network address 
translations to securely communicating packets between the first computer device 

25 and the second computer device. 

17. A method according to claim 16, characterised in that the security protocol 
determines a standard port number for a key management connection, and the 
method further comprises the step of comparing in the received indicator packet a 
source port number against said standard port number for a key management 

30 connection. 

18. A method for securely communicating packets between a first computer device 
(181, INITIATOR) and a second computer device (185, RESPONDER) through a 
packet-switched data transmission network (184) comprising intermediate computer 
devices (183, 187, 305), where at least one of said computer devices performs a 

35 network address translation and/or a protocol conversion; where a security protocol 
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is acknowledged which determines transport-mode processing of packets for 
transmission and reception; and where a high-level protocol checksum has been 
determined for checking the integrity of received packets, characterised in that the 
method comprises the steps of 

- at the first computer device, performing transport-mode processing for packets to 
be transmitted to the second computer device, 

- at the second computer device, performing transport-mode processing for packets 
received from the first computer device, said transport-mode processing comprising 
the decapsulation of received packets and 

- at the second computer device, updating the high-level protocol checksum for 
decapsulated packets for compensating for changes, if any, caused by network 
address translations. 

19. A method according to claim 18, characterised in that 

- the step of performing transport-mode processing at the first computer device for 
packets transmitted to the second computer device takes the form of performing 
transport-mode processing as determined in the IPSEC protocol suite, and 

- the step of performing transport-mode processing at the second computer device 
for packets received from the first computer device takes the form of performing 
transport-mode processing as determined in the IPSEC protocol suite. 

20. A method according to claim 18, characterised in that it additionally 
comprises the steps of 

- at the first computer device, after performing transport-mode processing for a 
packet to be transmitted to the second computer device, encapsulating the processed 
packet into a packet conforming to a certain second protocol, which second protocol 
is capable of traversing network address translations and 

- at the second computer device, before performing transport-mode processing for a 
packet received from the first computer device, decapsulating the received packet 
from the packet conforming to said second protocol and replacing a number of 
network addresses in the decapsulated packet with a corresponding number of 
network addresses taken from the received packet before decapsulation. 

21. A method according to claim 18, characterised in that the step of updating the 
high-level protocol checksum takes the form of recomputing the checksum for the 
transport-mode-processed packets. 
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22. A method according to claim 18, characterised in that the method additionally 
comprises the step of obtaining information about the network addresses of the first 
and second computer devices before and after network address translations, and the 
step of updating the high-level protocol checksum takes the form of incrementally 

5 updating the checksum based on the obtained information about the network 
addresses of the first and second computer devices before and after network address 
translations. 

23. A method for mamtaining the unchanged form of address translations 
performed by network address translation devices on encapsulated actual data 

10 packets transmitted with certain address information between a first computer 
device (181, INITIATOR) and a second computer device (185, RESPONDER) 
through a packet-switched data transmission network (184), characterised in that 
the method comprises the step of 

- forcing at least one of the first computer device and the second computer device to 
15 transmit (306) to the other computer device keepalive packets with address 
information identical to that of actual data packets at a high enough frequency so 
that network address translation devices constantly reuse the mappings used for 
network address translation (305) even when a certain fraction of the packets 
communicated between the first computer device and the second computer device 
20 are lost in the network. 
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